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Abstract 

The paper is organized as a self-contained literate Haskell 
program that implements elements of an executable fi- 
nite set theory with focus on combinatorial generation and 
arithmetic encodings. The code, tested under GHC 6.6.1, 



is available at http://iogic.csci.unt.edu/tarau/ 
^esearch/2008/f SET . zip 

We introduce ranking and unranking functions generaliz- 
ing Ackermann's encoding to the universe of Hereditarily Fi- 
nite Sets with Urelements. Then we build a lazy enumerator 
for Hereditarily Finite Sets with Urelements that matches the 
unranking function provided by the inverse of Ackermann's 
encoding and we describe functors between them resulting 
in arithmetic encodings for powersets, hypergraphs, ordinals 
and choice functions. After implementing a digraph repre- 
sentation of Hereditarily Finite Sets we define decoration 
functions that can recover well-founded sets from encodings 
of their associated acyclic digraphs. We conclude with an en- 
coding of arbitrary digraphs and discuss a concept of duality 
induced by the set membership relation. 

Keywords hereditarily finite sets, ranking and unrank- 
ing functions, executable set theory, arithmetic encodings, 
Haskell data representations, functional programming and 
computational mathematics 

1. Introduction 

While the Universe of Hereditarily Finite Sets is best known 
as a model of the Zermelo-Fraenkel Set theory with the 
Axiom of Infinity replaced by its negation (! Takahashi|1976 



Meir et al.|1983j ), it has been the object of renewed practical 
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interest in various fields, from representing structured data 
in databases ( Leontjev and Sazonov|2000^ to reasoning with 
sets and set constraints in a Logic Programming framework 



( Dovier et al.|2000[ [Piazza and Pohcriti|2004HDovier et al.| 
|200T] i. 

The Universe of Hereditarily Finite Sets is built from the 
empty set (or a set of Urelements) by successively applying 
powerset and set union operations. A surprising bijection. 



discovered by WiUielm Ackerman n in 1937 (Ackermann 
[T9371 [Abian and Lamacchia|[T978| |Kaye and Wong||2007| l 
from Hereditarily Finite Sets to Natural Numbers, was the 
original trigger for our work on building in a mathematically 
elegant programming language, a concise and executable 
hereditarily finite set theory. The arbitrary size of the data 
objects brought in the need for arbitrary length integers. 
The focus on potentially infinite enumerations brought in the 
need for lazy evaluation. These have made Haskell a natural 
choice. 

We will describe our constructs in a subset of Haskell 
( Peyton Jones|2002 2003a|b| l seen as a concrete syntax for 
a generic lambda calculus based functional languag^ 

We will only make the assumptions that non-strict func- 
tions (higher order included), with call-by-need evaluation 
and arbitrary length integers are available in the language. 
While our code will conform Haskell's type system, we wiU 
do that without any type declarations, by ensuring that the 
types of our functions are all inferred. This increases chances 
that the code can be ported, through simple syntax transfor- 
mations, to any programming language that implements our 
basic assumptions. 

The paper is organized as follows: section |2] introduces 
the reader to combinatorial generation with help of a bit- 
string example, section [3] introduces Ackermann's encod- 



' As a courtesy to the reader wondering about the title, the author confesses 
being a hitchhiker in the world of functional programming, coming from 
the not so distant galaxy of logic programming but still confused by recent 
hitchhiking trips in the exotic worlds of logic synthesis, foundations of 
mathematics, natural language processing, conversational agents and virtual 
reality. And not being afraid to go boldly where ... a few others have already 
been before. 



ing in the more general case when urelements are present 
and shows an encoding for hypergraphs as a particular case. 
Section |4] gives examples of transporting common set and 
natural number operations from one side to the other. After 
discussing some classic pairing functions, section |5] intro- 
duces new pairing/unpairing on natural numbers. Section |6] 
discusses graph representations and decoration functions on 
Hereditarily Finite Sets (6.1 1, and provides encodings for di- 



rected acyclic graphs (6.3 1. Sections |7] and |8] discuss related 
work, future work and conclusions. 

2. What's in a Bit? 

Let us observe first that the well known bitstring representa- 
tion of Natural Numbers (see to_rbits and from_rbit in 
Appendix and notice the reversed bit order) is a first hint at 
their genuinely polymorphic, "shapeshifting" nature: 

to_rbits 2008 

[0,0,0,1,1,0,1,1,1,1,1] 
f rom_rbits [0,0,0,1,1,0,1,1,1,1,1] 

2008 

The effect is trivial here - these transformers turn a number 
into a list of bits and back. One step further, we will now 
define two one argument /Mncf/oni, that implement the "bits" 
o and i: 

X = 2*x+0 

1 X = 2*x+l 

One can recognize now that 2008 is just the result of 
composing "bits", with a result similar to the result of 
f rom_rbits; 

(o . o . o . i . i . o . i . i . i . i . i) 

2008 

The reader will notice that we have just "shapeshifted" to yet 
another view: a number is now a composition of bits, seen 
as transformers, where each bit does its share by leftshifting 
the string one position and then adding its contribution to 
it. Note the analogy with Church numerals, which represent 
numbers as iterations of function application, except that 
here n will only need 0(log2{n)) of space. 

Like with the usual bitstring representation, the dominant 
digit is always 1, zeros after that have no effect, from where 
we can infer that the mapping between such bitstrings and 
numbers is not one-to-one. A variant of the 2-adic bijective 
numeral representation fixes this, and shows one of the sim- 
plest bijective mappings from natural numbers to bitstrings 
(i.e. the regular language {0, 1}*): 

nat2bits = drop_last . to_rbits . succ where 
drop_last bs=genericTake (1-1) bs where 
l=genericLength bs 

bits2nat bs = (from_rbits (bs -\-<r [1]))-1 
nat2bits 42 



[1,1,0,1,0] 
bits2nat it 
42 

map nat2bits [0..15] 

[[] , [0] , [1] , [0,0] , [1,0] , [0,1] , [1,1] , 
[0,0,0] , [1,0,0] , [0,1,0] , [1,1,0] , 
[0,0,1] , [1,0,1] , [0,1,1] , [1,1,1] , 
[0,0,0,0]] 

The last example suggests that we are now able to generate 
the infinite stream of all possible bitstrings simply as as: 

all_bitstrings — map nat2bits [0. .] 

We will now hitchhike with this design pattern in our tool- 
box to a more interesting universe. 

3. Hereditarily Finite Sets and the 
Ackermann Encoding 

The Universe of Hereditarily Finite Sets (HFS) is built from 
the empty set (or a set of Urelements) by successively ap- 
plying powerset and set union operations. Assuming HFS 
extended with Urelements (i.e. objects not having any el- 
ements), the following data type defines a recursive "rose 
tree" for Hereditarily Finite Sets: 

data HFS t = U t | S [HFS t] deriving (Show, Eq) 

We will assume that Urelements are represented as Natural 
Numbers in [0. .ulimit-1]. The constructor U t marks 
Urelements of type t (usually the arbitrary length Integer 
type in Haskell) and the constructor S marks a list of recur- 
sively built HFS type elements. Note that if no elements 
are used with the U constructor, we obtain the "pure" HFS 
universe by representing the empty set as S [] . 

3.1 Ackermann's Encoding 

A surprising bijection, discovered by WiUielm Ackermann in 



1937 (jAckermann|1937 Abian and Lamacchia|1978" Kaye 



and Wong|r2007[ ) maps Hereditarily Finite Sets (HFS) to 
Natural Numbers (Nat): 

fix) = if a; = {} then else J^aex 2^'^"^ 

Let us note that Ackermann's encoding can be seen as 
the recursive application of a bijection set2nat from finite 
subsets of Nat to Nat, that associates to a set of (distinct!) 
natural numbers a (unique!) natural number 

A simple change to Ackermann's mapping, will accomo- 
date a finite number of Urelements in [0..u — 1], as follows: 

fu{x) = ±f X < u then x else u + J^aex 2^"^"^ 

Proposition 1 . For u G Nat the function fu is a bijection 
from Nat to HFS with Urelements in [0..u — 1]. 



The proof follows from the fact that no sets map to val- 
ues smaller than ulimit and that Urelements map into them- 
selves. 

With this representation, Ackermann's encoding from 
i/FS* with Urelements in [0. .ulimit-1] to iVai hf s2nat_ 
becomes: 

hfs2nat (U n) = n 

hfs2nat_ ulimit (S es) — 

ulimit + set2nat (map (hfs2nat_ ulimit) es) 

set2nat ns — sum (map (2~) ns) 

where set2nat maps a set of exponents of 2 to the associ- 
ated sum of powers of 2. 
We can now define 

hfs2nat = hfs2nat_ urelement_limit 
urelement_limit=0 

where the constant urelement_limit controls the initial 
segment of Nat to be mapped to Urelements. Note that to 
keep our Haskell code as simple as possible we assume that 
urelement_limit is a global parameter that implicitly fixes 
the set of Urelements. 

To obtain the inverse of the Ackerman encoding, let's first 
define the inverse nat2set of the bijection set2nat. It de- 
composes a natural number into a list of exponents of 2 (seen 
as bit positions equaling 1 in its bitstring representation, in 
increasing order). 

nat2set n = nat2right_exps n where 
nat2right_exps _ = [] 

nat2right_exps n e = add_rexp (n 'mod' 2) e 
(nat2right_exps (n 'div' 2) (efl)) where 
add_rexp _ es = es 
add_rexp 1 e es = (e:es) 

nat2set 42 

[1,3,5] 
set2nat [1,3,5] 

42 

nat2set 2008 

[3,4,6,7,8,9,10] 
set2nat [3,4,6,7,8,9,10] 

2008 

The inverse of the (bijective) Ackermann encoding (general- 
ized to work with urelements in [0 . . ulimit-1] ) is defined 
as follows: 

nat2hf s_ ulimit n | n<ulimit = U n 
nat2hf s_ ulimit n = 

S (map (nat2hfs_ ulimit) (nat2set (n-ulimit))) 

We can now define 

nat2hf s = nat2hf s_ urelement_limit 

where the constant urelement_limit controls the initial 
segment of Nat to be mapped to Urelements. 



As both nat2hfs and hfs2nat are obtained through 
recursive compositions of iiat2set and set2nat, respec- 
tively, one can generalize the encoding mechanism by re- 
placing these building blocks with other bijections with sim- 
ilar properties. 

One can try out nat2hf s and its inverse hf s2nat and 
print out a HFS with the setShow function (given in Ap- 
pendix): 

nat2hfs 42 

S [S [U 0],S [U 0,S [U 0]],S [U 0,S [S [U 0]]]] 
hfs2nat (nat2hfs 42) 
42 

setShow 42 

" {{{}} , {{} , {{»} , {{} , {{{}}}»" 

Assuming urelemeiit_limit=3 the HFS representation be- 
comes: 

nat2hfs 42 

S [U 0,U 1,U 2,S [U 1]] 
setShow 42 

"{0,1,2,{1}}" 

Note that setShow n will build a string representation of 
n e Nat, "shapeshifted" as a HFS with Urelements. Figure 
[T]shows directed acyclic graphs obtained by merging shared 
nodes in the rose tree representation of the HFS associated 
to a natural number (with arrows pointing from sets to their 
elements). 




Figure 1 : Hereditarily Finite Set associated to 42 

3.2 Combinatorial Generation as Iteration 

Using the inverse of Ackermann's encoding, the infinite 
stream HFS can be generated simply by iterating over the 
infinite stream [0 . . ] : 

iterative_hf s_generator = map nat2hfs [0. .] 



take 5 iterative_hf s_generator 
[U , S [U 0] , S [S [U 0] ] , 
S [U 0,S [U 0]] ,S [S [S [U 0]]]] 

3.3 Generating the Stream of Hereditarily Finite Sets 
Directly 

To fully appreciate the elegance and simplicity of the combi- 
natorial generation mechanism described previously, we will 
also provide a "hand-crafted" recursive generator for HFS. 
The reader will notice that this uses some fairly high level 
Haskell constructs like list comprehensions and lazy evalu- 
ation, and that in a language without such features the algo- 
rithm might get significantly more intricate. 

If P{x) denotes the powerset of .t, the Universe of Hered- 
itarily Finite Sets HFS is constructed inductively as fol- 
lows: 

1. the empty set {} is in HFS 

2. if X is in HFS then the union of its power sets P^{x) is 
in HFS 

To implement in Haskell a simple HFS generator, conform- 
ing this definition, we start with a powerset function, work- 
ing with sets represented as lists: 

list_subsets [] — [[]] 
list_subsets (x:xs) = 

[zs I ys^list_subsets xs,zs^ [ys, (x:ys)]] 

We can generate the infinite stream of "pure" hereditarily 
finite sets using Haskell's lazy evaluation mechanism, as 
follows: 

hf s_generator = uhfs_from where 

uhfs_from k = union (old_hfs k) (uhfs_from (k+1)) 

old_hfs k = elements_of (hpow k (U 0)) 
elements_of (U _) = [] 
elements_of (S hs) — hs 

hpow h = h 

hpow k h = hpow (k-1) (S (hsubsets h) ) 
hsubsets (U n) = [] 

hsubsets (S hs) = (map S (list_subsets hs)) 

One can now extract a finite number of HFS from the 
stream 

take 5 hf s_generator 
[S [],S [S []],S [S [S []]], 
S [S [],S [S []]],S [S [S [S []]]]] 

and notice the identical behavior of hf s_generator and 
iterative Jifs_generat or. 

3.4 Encoding Hypergraphs 

Definition 1. A hypergraph (also called set systemj is a 
pair H — {X, E) where X is a set and E is a set of non- 
empty subsets of X. 



By limiting recursion to one level in Ackermann's encoding, 
we can derive a bijective encoding of hypergraphs, repre- 
sented as sets of sets: 

nat2hypergraph — (map nat2set) . nat2set 
hypergraph2nat — set2nat . (map set2nat) 

as shown in the following example: 

nat2hypergraph 2008 

[[0,1] , [2] , [1,2] , [0,1,2] , [3] , [0,3] , [1,3]] 
hypergraph2nat (iiat2hypergraph 2008) 

2008 

As in the case of HFS combinatorial generation of the 
infinite stream of hypergraphs becomes simply 

map nat2hypergraph [0 . . ] 

Note also that a hypothetical application using integers, fi- 
nite sets and hypergraphs can use internally the same im- 
mutable data type, with opportunities to share common 
structures. 

In the following sections we will think about Acker- 
mann's encoding and its inverse as Functors in Category 
Theory ( Pierce|1991) l, transporting various operations from 
Natural Numbers to Hereditarily Finite Sets and back. 

4. Shapeshifting Operations between Nat 

and HFS 
4.1 Fold operators and functors 

Given the rose tree structure of HFS, a natural fold oper- 
ation ( |Nipkow and Paulson 2005 j can be defined on them as 
a higher order Haskell function: 

hfold f g (U n) = g n 

hfold f g (S xs) = f (map (hfold f g) xs) 

For instance, it can count how many sets occur in a given 
HFS, as follows: 

hsize = hfold f g where 
f xs = l+(sum xs) 
g - =1 

Note that recursing over nat2set has been used to build a 
member of HFS from a member of Nat. Thus, we can com- 
bine it with the action of a fold operator working directly 
on natural numbers as follows: 

nfold f g n = nf old_ f g urelement_limit n 

nfold_ f g ulimit n | n<ulimit = g n 
nfold_ f g ulimit n = 

f (map (nfold_ f g ulimit) (nat2set n)) 

For instance, nfold allows counting the elements contained 
in the HFS representation of a number: 

nsize = nfold f g where 
f xs = l+(sum xs) 
E _ =1 



as if defined by 

nsize_alt n = hsize (nat2hf s n) 

The action of the Ackermann encoding as a Functor from 
HFS to Nat on morphisms (seen as functions on a list of 
arguments) is defined as follows: 

toNat f = nat2hf s . f . (map hf s2nat) 

The same, acting on 1 and 2 argument operations is: 

toNatl f i = nat2hfs (f (hfs2nat i)) 

toNat2 f i j = nat2hf s (f (hfs2nat i) (hfs2nat j)) 

The inverse Ackermann encoding acts as a Functor from 
Nat to HFS: 

toHFS f = hf s2nat . f . (map nat2hf s) 

with variants acting on a 1 and 2 argument functions: 

toHFSl f X = hfs2nat (f (nat2hfs x) ) 

toHFS2 f X y = hfs2nat (f (nat2hfs x) (nat2hfs x)) 

Note that the nat2set and set2nat functions used in the 
Ackerman encoding and its inverse can also be seen as pro- 
viding Functors connecting Nat and [Nat] (seen as a repre- 
sentation of finite subsets of Nat): 

toExps f = set2nat . f . (map nat2set) 
fromExps f — nat2set . f . (map set2nat) 

4.2 Mappings between Arithmetic and Set Operations 

After extending 2 argument set operations to lists, using 
foldl 

setOp f []=[] 

setOp f (x:xs) = foldl f x xs 

we can define the equivalent of adduction (i.e. {{] U s - 
see ( |Kaye and Wong|2007[|Kirby|2007| l), union, intersection 
etc., on natural numbers seen as (Usts of) sets: 

nat_adduction i is = 

set2nat (union [i] (nat2set is)) 

nat_singleton i — 2"! 

nat_intersect = nats_intersect . nat2set 
nats_intersect = toExps (setOp intersect) 

nat_union = nats_union . nat2set 
nats_union — toExps (setOp union) 

nat_equal i j = if i=j then 1 else 

Similarly, we can transport from Nat to HFS, opera- 
tions like successor, addition, product, equality as follows: 

hsucc — toNatl succ 
hsum — toNat sum 
hproduct = toNat product 
hequal = toNat2 nat_equal 
hexp2 = toNatl (2") 



with the practical idea in mind that one can pick the most 
efficient (or the simpler to implement) of the two represen- 
tations at will. 

As current computer architectures tend to support Natu- 
ral Numbers and underlying arbitrary integer representations 
quite well, we can pick them as the hub that mediates the 
"shapeshiftings" between various data types. However, in an 
application where lazy structure building would be instru- 
mental for performance, something like HFS (or one of the 
encodings described in the next sections) could be the most 
appropriate internal representation. 

5. Pairing Functions 

Pairings are bijective functions Nat x Nat Nat. Fol- 
lowing the classic notation for pairings of ( |Robinson|1950| l, 
given the pairing function J, its left and right inverses K and 
L are such that 



J{K{z),L{z)) = z 



K{J{x,y))^x 



(1) 



(2) 



(3) 



L{J{x,y)) = y 

We refer to ( Cegielski and Richard 20011) for a typical use 
in the foundations of mathematics and to ( Rosenberg 2002jl 
for an extensive study of various pairing functions and their 
computational properties. 

On top of the "set operations" defined in subsection 
4.2 on Nat, the classic Kuratowski ordered pair (a, b) = 
{{a}, {a, b}} can be implemented with adductions and sin- 
gletons as follows: 

nat_kpair x y = nat_adduction sx ssxy where 
sx — nat_singleton x 
sy — nat_singleton y 
sxy = nat_adduction x sy 
ssxy = nat_singleton sxy 

However, the Kuratowski pair only provides an injective 
function Nat x Nat Nat, resulting in fast growing 
integers very quickly: 

[nat_kpair x y I x<- [0 . . 3] , y<- [0 . . 3] ] 
[2,10,34,514,12,4,68,1028,48, 
80 , 16 , 41 12 , 768 , 1280 , 4352 , 256] 

5.1 Cantor's Pairing Function 

We can do better by borrowing some interesting pairing 
functions defined on natural numbers. Starting from Can- 
tor's pairing function bijections from Nat x Nat to Nat 
have been used for various proofs and constructions of math- 



ematical objects (Robinson 1950 1955 il968a,b, ,Cegielski 



land Richard|200Tr 

Cantor's pairing function is defined as: 

nat_cpair x y = (x+y) * (x+y+1) 'div' 2+y 



Note that its range is more compact 

[nat_cpair i j I i<-[0. .3] , j<-[0. .3]] 

[0,2,5,9,1,4,8,13,3,7,12,18,6,11,17,24] 

Unfortunately, its inverse involves floating point operations 
that do not combine well with arbitrary length integers. 

5.2 A new Pairing Function 

We will introduce here a new pairing function, that provides 
compact representations for various set theoretic constructs 
involving ordered pairs while only using elementary integer 
arithmetic operations. 

Our bijection bitmerge_pair from Nat x Nat to Nat 
and its inverse to_pair are defined as follows: 

bitmerge_pair (i,j) = 

set2nat ((evens i) -(-f (odds j)) where 
evens x = map (*2) (nat2set x) 
odds y — map succ (evens y) 

bitmerge_unpair n = (f xs,f ys) where 
(xs.ys) = partition even (nat2set n) 
f = set2nat . (map ('div' 2)) 

The function bitmerge_pair works by splitting a number's 
big endian bitstring representation into odd and even bits 
while its inverse to_pair blends the odd and even bits back 
together With help of the function to_rbits given in Ap- 
pendix, that decomposes n E Nat into a list of bits (smaller 
units first) on can follow what happens, step by step: 

to_rbits 2008 

[0,0,0,1, 1,0,1,1, 1,1,1] 
bitmerge_unpair 2008 

(60,26) 
to_rbits 60 

[0,0, 1,1, 1,1] 
to_rbits 26 

[0,1, 0,1, 1] 
bitmerge_pair (60,26) 

2008 

Note also the significantly more compact packing, com- 
pared to Kuratowski pairs, and, like Cantor's pairing func- 
tion, similar growth in both arguments: 

map bitmerge_unpair [0..15] 

[(0,0), (1,0), (0,1), (1,1), (2,0), (3,0), 

(2.1) , (3,1), (0,2), (1,2), (0,3), (1,3), 

(2. 2) , (3, 2), (2, 3), (3, 3)] 
[bitmerge_pair (i,j) |i<-[0. . 3] , j<- [0 . . 3] ] 

[0,2,8,10,1,3,9,11,4,6,12,14,5,7,13,15] 

5.3 Powersets, Ordinals and Choice Functions 

A concept of (finite) powerset can be associated to a number 
n G Nat by computing the powerset of the HFS associated 
to it: 

nat_powset i — set2nat 



(map set2nat (list_subsets (nat2set i))) 
or, directly, as in ( Abian and Lamacchia|1978) : 



nat_powset_alt i = product 

(map (Ak^l-|-2"(2~k)) (nat2set i)) 

The von Neumann ordinal associated to a HFS, defined 
with interval notation as A = [0, A), is implemented by the 
function hf s_ordinal, simply by transporting it from Nat: 

nat_ordinal 0=0 
nat_ordinal n = 

set2nat (map nat_ordinal [0..(n-l)]) 

hfs_ordinal = nat2hfs . nat_ordinal 

The following example shows the transitive structure of a 
von Neumann ordinal's set representation (see Fig.|2]i. It also 
shows its fast growing Nat encoding (4 2059) which can 
be seen as a somewhat unusual injective embedding of finite 
ordinals in Nat, seen as the set of finite cardinals. 

hfs_ordinal 4 

S [S [] ,S [S []] ,S [S [] , 

S [S []]],S [S [],S [S []], 
S [S [],S [S []]]]] 
nat_ordinal 4 
2059 

Finally, a choice function is implemented as an encoding 
of pairs of sets and their first elements with our compact 
Nat X Nat Nat pairing function: 

nat_choice_f un i = set2nat xs where 
es — nat2set i 

hs — map (head . nat2set) es 

xs = zipWith (curry bitmerge_pair) es hs 

As even numbers represent sets that do not contain the empty 
set as an element, we compute Nat representations of the 
choice function as follows: 

map nat_choice_f un [0,2.. 16] 

[0,2,64,66,32,34,96,98,16777216] 

Note that iiat_choice_f unction computes a natural num- 
ber representation i.e. Goedel number for a function that 
picks an element of each set of any family of sets not con- 
taining the empty set. Constructing such a natural number 
proves that Nat, with the structure borrowed from HFS is 
actually a model for the Axiom of Choice. Such models are 
important in the foundations of mathematics as they show 
that interpretations of sets and functions other the usual ones 
are compatible with various axiomatizations of set theory 
( Kaye and Wong|2007|[Kjrby|2007| l. 



6. Directed Graph Encodings 

Directed Graphs are equivalent to binary relations seen as 
sets of ordered pairs. Equivalently, (as implemented in the 
Haskell Data. Graph package), they can also be seen as 



Haskell ordered pairs. The following examples show how 

this works: 



a 




i 




Figure 2: 4 and its associated ordinal: as a pure HFS and 
its associated ordinal 2059 



arrays of vertices in [0..n] paired with lists of vertices of 
adjacent outgoing edges. We will freely alternate between 
these two representations in this section. 

6.1 Directed Acyclic Graph representations for HFS 

The rose tree representation of HFS can be seen as a set 
of edges, oriented to describe either set membership e or its 
transpose, set contaiimient. 

rLat2memb = nat2pairs with_memb 

nat 2 contains = nat2pairs with.contains 

with_memb a x = (x,a) 
with_contains a x = (a,x) 

Note that this uses the function iiat2pairs (see Appendix) 
that provides the actual decomposition of a number into 



nat2memb 42 

[(0,1), (0,3), (0,5), (1,2), (1,3), 
(1,42), (2, 5), (3, 42), (5, 42)] 
nat2coiitains 42 

[(1,0), (2,1), (3,0), (3,1), (5,0), 
(5, 2), (42,1), (42, 3), (42, 5)] 

These hst of pair representations can be easily converted to 
Haskell's graph data type (imported from Data.Graph) as 
follows: 

nat2member_clag = nat2dag_ nat2memb 
nat2contains_dag = nat2dag_ nat2contains 

nat2dag_ f n = buildG (0,1) es where 

es==reverse (f n) 

l^^oldl max (nat2parts n) 

where nat2parts, given in the Appendix, converts n to the 
set of Natural Numbers occurring in its HFS representation. 

Moreover, the pair representation of G and its inverse 
can be turned into a more compact graph by replacing its n 
distinct vertex numbers with smaller integers from [0..n— 1], 
by progressively building a map describing this association, 
as shown in the function to_dag 

to_dag n = (buildG (0,1) 

(map (remap m) (nat2contains n))) where 
is = [0. .1] 

ns = reverse (nat2parts n) 

1 = (genericLength ns)-l 
iit= (zip ns is) 

remap m (f,t) = (If, It) where 
(Just lf)=(lookup f m) 
(Just lt)=(lookup t m) 

Dually, one can convert n G Nat to the containment 
graph of its HFS as follows 

to_ddag = transposeG . to_dag 

An interesting question arises at this point. Can we re- 
build a natural number from its directed acyclic graph rep- 
resentation, assuming no labels are available, except 0? Sur- 
prisingly, the answer is yes, and the function f rom_dag pro- 
vides the conversion: 

from_dag g = 

compute_decoration g (fst (bounds g)) 

compute_decoration g v = 

compute_decorations g (g!v) where 
compute_decorations _ [] = 
compute_decorations g es = 

sum (map ((2~) . (compute.decoration g) ) es) 

to_dag 42 

array (0,5) [(0, [1 ,2,4] ) , (1 , [3,5] ) , (2, [4,5] ) , 



(3,[4]),(4,[5]),(5,[])] 
from_dag (to_dag 42) 
42 

to_ddag 42 

array (0,5) [(0, [] ) , (1 , [0] ) , (2, [0] ) , (3, [1] ) , 
(4, [3, 2,0]), (5, [4,2,1])] 

After implementing this function, we have found that it 
closely follows the decoration functions used in Aczel's 
book ( Aczel|1988 i, and renamed it compute_decoration. 
In the simpler case of the HFS universe, with our well- 
founded sets represented as DAGs, the existence and unicity 
of the result computed by f rom_dag follows immediately 
from the Mostowski Collapsing Lemma (( Aczel|1988| l). 



6.2 Extensional/Intensional Duality 

What can be said about the graphs obtained by reversing 
the direction of the arrows representing the £ relation? 
Intuitively, it corresponds to the fact that intensions/con- 
cepts would become the building blocks of the theory, pro- 
vided that something similar to the axiom of extensionality 
holds. In comments related to Russell's type theory (Goedel] 
1999 1 pp. 457-458 Godel mentions an axiom of intension- 



ality with the intuitive meaning that "different definitions 
belong to different notions". Godel also notices the dual- 
ity between "no two different properties belong to exactly 
the same things" and "no two different things have exactly 
the same properties" but warns that contradictions in a sim- 
ple type theory would result if such an axiom is used non- 
cons tructively. 

We can now look for the presence of intensional/exten- 
sional symmetry in HFS by trying to rebuild a HFS repre- 
sentation from the transpose of e, 3: 



from_ddag g = 

compute_decoration g (snd (bounds 



)) 



intensional_dual_of = from_ddag . to_ddag 

Are such representations self-dual? Let's define as self-dual 
a number n G Nat that equals its intensional dual and then 
filter self-dual numbers in an interval: 

self_idual n = n=intensional_dual_of n 

self_iduals from to = 

filter self_idual [from.. to] 

Unfortunately, as the following example shows, relatively 
few numbers are self-duals: 

self_iduals 1000 

[0,1,2,3,4,5,10,11,16,17,34,35, 
64 , 65 , 130 , 131 , 264 , 265 , 522 , 523] 

Figures|3]and|4]show some HFS graphs of natural numbers 
equal to their intensional duals. 

We will leave it as a topic for future research to investigate 
more in depth, various aspects of e / 9 duality in HFS, in 
correlation with Natural Numbers and their encodings. 





Figure 3: self-dual and its intensional dual as HFS graphs: 
131 and the dual of 131 



6.3 Encodings of Directed Graplis as Natural Numbers 

Hypersets (lAczel 1988 ) are defined by replacing the Founda- 
tion Axiom with the AntiFoundation axiom. Intuitively this 



means that the G-graphs can be cyclical (Barwise and Moss| 



1996 1, provided that they are minimized through bisimu- 



lation equivalence ( [Dovier et al.||200T i. We have not (yet) 
found an elegant encoding of hereditarily finite hypersets as 
natural numbers, similar to Ackerman's encoding. The main 
difficulty seems related to the fact that hypersets are modeled 
in HFS as equivalence classes with respect to bisimulation 
( |Aczel|[T988l [Barwise and Moss|[T996l [Piazza and PoUcriti| 
2004[ l. Toward this end, an easy first step seems to find a bi- 



jection from directed graphs (with no isolated vertices, cor- 
responding to their view as binary relations), to Nat: 

nat2digraph n = map bitmerge_unpair (nat2set n) 





Figure 4: self-dual and its intensional dual as HFS graphs: 
16393 and the dual of 16393 



digraph2nat ps = set2nat (map bitmerge_pair ps) 

With digraphs represented as lists of edges, this bijection 
works as follows: 

nat2digraph 2008 

[(1,1), (2,0), (2,1), (3,1), 
(0,2), (1,2), (0,3)] 
digraph2nat (nat2digraph 2008) 

2008 

nat2digraph (255) 

[(0,0), (1,0), (0,1), (1,1), 

(2,0), (3,0), (2,1), (3,1)] 
digraph2nat (nat2digraph 255) 

255 

As usual 



map nat2digraph [0 . . ] 

provides a combinatorial generator for the infinite stream of 
directed acyclic graphs. 

7. Related work 

Natural Number encodings of Hereditarily Finite Sets have 
triggered the interest of researchers in fields ranging from 
Axiomatic Set Theory and Foundations of Logic to Com- 



and Wong||2007 Kirby 


12007 


Abian and Lamacchia 19781 


Kirby|2007 Meir et al. 


1983 


Leontjev and Sazonov 2000) 


Sazonov|1993 Avigad|1997|l. Graph representations of sets 



and hypersets based on the variants of the Anti Foundation 
Axiom have been studied extensively in ( |Aczel|[T988 Bar- 



|wise and Moss|1996 1. Computational and Data Representa 
tion aspects of Finite Set Theory and hypersets have been de- 
scribed in logic programming and theorem proving contexts 
in (Dovieret alT 2000 ; Piazza a nd PoUcriti|2004||Dovier et al.| 
[200 1 , jPaulson, 1 994 j . Pairing functions have been used work 
on decision problems as early as ( Pepis|1938 Kalmar|1939 



Robinson 1950 1955 1968a'b'). Various mappings from nat- 



ural number encodings to Rational Numbers are described 
in ( [Gibbons et al.|[2006l l, also in a functional programming 
framework. 

8. Conclusion and Future Work 

Implementing with relative ease the encoding techniques 
typically used only in the foundations of mathematics rec- 
ommends Haskell as a surprisingly effective tool for experi- 
mental mathematics. 

We have described a variety of isomorphisms between 
mathematically interesting data structures, all centered around 
encodings as Natural Numbers. The possibility of sharing 
significant common parts of HFS -represented integers could 
be used in implementing shared stores for arbitrary length 
integers. Along the same lines, another application would 
be data compression using some "information theoretically 



minimal" variants of the graphs in subsection 6.1 from 
which larger, HFS and/or natural numbers can be rebuilt. 

Last but not least, making more accessible to computer 
science students some of the encoding techniques typically 
used only in the foundations of mathematics (and related 
reasoning techniques), suggests applications to teaching dis- 
crete mathematics and/or functional languages in the tradi- 
tion of Pall and O'DonnelipOOO) . 
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Appendix 

To make the code in the paper fully self contained, we list 
here some auxiliary functions. 

Bit crunching functions The following functions imple- 
ment conversion operations between bitlists and numbers. 
Note that our bitlists represent binary numbers by selecting 
exponents of 2 in increasing order (i.e. "right to left"). 

— from decimals to binary as list of bits 
to_rbits n — to_base 2 n 

— from bits to decimals 
from_rbits bs = from_base 2 bs 

— conversion to base n, as list of digits 
to_base base n = d : 

(if q=0 then [] else (to_base base q) ) where 
(q,d) — quotRem n base 

— conversion from any base to decimal 
from_base base [] = 

from_base base (x:xs) = x-fbase* (f rom_base base xs) 



String Representations The function setShow provides a 
string representation of a natural number as a "pure" HFS. 

setShow n = sShow urelemeiit_limit n 

The function sShow provides a string representation of a 
natural number as a HFS with Urelements. 

sShow 1 = "{}" 

sShow ulimit n | n<ulimit = show n 
sShow ulimit n = "{"-H- 
foldl (H-I-) "" 

(intersperse "," (map (sShow ulimit) (nat2set (n-ulimit) ) ) ) 
-H-"}" 

Conversion to Ordered Pairs The function nat2pairs 
converts a natural number to a set of Haskell ordered pairs 
expressing the e relation on its associated HFS or its dual 
3. 

nat2pairs withF n = (sort . nub) (nat2ps withF n) 

nat2ps withF = [] 
nat2ps withF from = 

((n2rel ns) -H- (ns2rel ns)) where 
f = withF from 
n2rel = map f 

ns2rel = concatMap (nat2ps withF) 
ns=*at2set from 

The function nat2parts converts n to the set of Natural 
Numbers occurring in the HFS representation of n. 

nat2parts = sort . nub . nat2repeate<i where 
nat2repeated = [0] 

nat2repeated from = from : (nat2more ns) where 
nat2more = concatMap nat2repeated 
ns=^at2set from 



